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Abstract 

Expression of the light-harvesting complex protein genes (Lhc) is under the control of a circadian clock. To dissect 
the molecular regulatory components of the circadian clock a promoter deletion analysis of four tomato Lhc genes 
was performed in transgenic tobacco plants. The important 5'-upstream promoter regions are present at different 
positions relative to the transcription start site of Lhc bl*l, bl*2, Lhc a3 and Lhc a4. A short sequence of 47 
nucleotides is necessary for conferring circadian Lhc mRNA oscillations. Sequence alignment of the specified 
promoter regions revealed a novel motif 'CAANNNNATC'. This motif is conserved in 5'-upstream regions of 
clock controlled Lhc genes and overlaps with a sequence relevant in phytochrome mediated gene expression. 

Abbreviations: Lhc alb, genes encoding light-harvesting complex proteins (formerly cab, chlorophyll ^-binding 
proteins); DD, continuous darkness; LD, light/dark conditions 



Circadian rhythms have been described in nearly every 
eukaryotic organism as well as in some prokaryotes 
[19]. Despite the universal appearance and accu- 
mulated knowledge about the phenomenon of such 
rhythms, the molecular mechanism of the circadian 
clock is still not understood. Evidence is accumulating 
that indicates that the transmission of the biological 
clock occurs by feedback and autoregulation [28, 4]. 
To shed some light on the basic machinery we started 
to investigate possible components necessary for the 
circadian mRNA accumulation of the light-harvesting 
complex proteins (Lhc) of plants. 

The light-harvesting complexes of plants are orga- 
nized as protein-pigment units in the thylakoid mem- 
branes of chloroplasts which enhance the probability 
for light quantum absorption and focused channelling 
of the energy to the photosynthetic reaction centres [ 1 , 
1 6]. The proteins and pigments of these complexes and 
their respective genes have been intensively investi- 
gated in plants. Besides several important findings it 



turned out that the synthesis of the proteins and the 
accumulation of respective mRNAs are under the con- 
trol of a circadian clock [24, 27]. Interestingly, all 19 
members of the tomato Lhc gene family exhibit this 
characteristic expression pattern [18]. Based on this 
similarity a common control mechanism is expected 
to function at the level of transcription of each tomato 
Lhc gene, for example cis- and mmy-regulatory ele- 
ments. 

Several distinct cis- regulatory motifs have been de- 
tected for plant genes [22, 29]. A prominent element 
is the 'ACGT\ the G-box. first described by Giuliano 
et al. [13], which is present in the promoter of plant 
genes encoding very different proteins [31]. However, 
the trans-acting factors binding to this motif belong 
to the same bZip type [3, 11, 15]. Another sequence 
( 4 CCTTATCAT) has been described and correlated 
with light-responsive expression of several plant genes 
[14] as well as the 4 GATA' motif (I-box). The lat- 
ter, described by Castresana et al. [8], is present 
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Figure J. Steady-state mRNA levels of the tomato Lhc bl*l (cab 1A) gene in transgenic tobacco. A. Transgenic tobacco plants carrying 
the deleted tomato Lhc gene promoter (A- 1091, A-701, A- 159, A-43) were grown in light/dark (light: 06:00-18:00) and continuous dark 
conditions. Leaves were harvested at indicated time points and tomato Lhc mRNAs were detected with the primer extension technique. The 
length of the extended primer is 70 bp. B. mRNA levels of deletion constructs were quantitated (fmol per mg RNA) and data of representative 
tobacco lines are presented. 
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rtgwre 2 Steady-state mRNA levels of the tomato Lhc a3 (ctf/> 8) gene in transgenic tobacco. Transgenic tobacco plants carrying the deleted 
tomato Lhc gene promoter (A-322, A-278, A-231, A- 148) were grown in light/dark (light: 06:00-18:00) and continuous dark conditions. 
Leaves were harvested at indicated time points and tomato Lhc mRNAs were detected with the primer extension technique. The length of the 
extended primer is 102 bp. 
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Figure 3. Steady-state mRNA levels of the tomato Lhc bl*2 {cab 1 B) gene in transgenic tobacco. A. Transgenic tobacco plants carrying the 
deleted tomato Lhc gene promoter (A-793, A-323, A-152) were grown in light/dark (light: 06.00-18:00) and continuous dark conditions. 
Leaves were harvested at indicated time points and tomato Lhc mRNAs were detected with the primer extension technique. The length of the 
extended primer is 45 bp. B. mRNA levels of the deletion constructs were quantitated (fmol per mg RNA) and data of representative tobacco 
lines are presented. 



in the promoter of several light-harvesting complex 
proteins (LHCP) [12, 25], of the small subunit of 
Rubisco [9] and in the CaMV 35S promoter [21]. In 
the case of many Lhc gene promoters this motif is 
two to four times repeated. The nucleotide distances 
between the 'GATA 1 sequences are highly conserved 
[25], Since the discovery of this motif and its abun- 
dant appearance a role as a regulatory unit had been 



postulated. Variation of the nucleotide sequence of this 
motif indicated that the 'GATA' element modulates the 
transcription positively rather than negatively [12]. A 
correlation of light- and/or tissue-specific expression 
with the presence of this motif was not observed. A 
possible function in mediating circadian rhythmicity 
of Lhc mRNA accumulation was discussed since a 
short promoter fragment of the Lhc bl * 1 (cab!) gene 
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Figure 3. Continued. 
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F/gwre 4. Steady-state mRNA levels of the tomato Lhc a4 [cab 1 1) gene in transgenic tobacco. The Lhc a4 promoter regions (-251 to +4 or 
-1 19 to +4) were cloned into the pBI 101 .2 reporter vector from Clontech. Transgenic tobacco plants containing the constructs were grown in 
light/dark (light: 06:00-18:00) and continuous dark conditions. Leaves were harvested at indicated time points and the glucuronidase mRNA 
levels were detected by northern blot analysis. The length of the gits mRNA is ca. 1.9 kb. For abbreviations, see Figure 5. 



of Arab idops is thaliana containing this motif confers 
circadian Lhc expression in transgenic plants [2]. In 
addition, the sequence ; ACTT 5 flanking the ; GATA' 



sequence in tomato was suggested to be a component 
of the signal transduction chain of the circadian clock 
[6]. 



659 



A-1091 A-701 
* ^ 



lOnucmotif 



A- 1 59 A-43 

J Lt 



70bp 



Lhc b1*1 



ICIH T 



cab 1A 



A-' 

i 



A-793 



Q 

A-323 



© 



A- 152 



lOnucinotif Lhc bl * 2 
± I 45bp i J 



I 1CIII T 



cab IB 



lOnucmotif 



A-322 A-278 A-231 A-148 



J4baj 



L LCC T 



Lhca3 



cab 8 



. © © 

A-251 A-119 

I tar— r 71bPf -J^ 

G L L/IIILL CT cab 11 

Figure 5. Summary of the deletion analysis. Circadian oscillations of tomato Lhc mRNAs in transgenic tobacco are indicated by the 'white' 
clock. No rhythmic Lhc mRNA accumulations were detected in tobacco lines transformed with the tomato Lhc bl* 1 A-I59 and A-43 and the 
Lhc a3 A-231 and A-148 deletion constructs and are indicated by the 'shaded' clock. T, 'TATA'; C, 'CCAAT'; I, 'GATA'; L, 'CCTTATCAT'; 
G/ACGT'; lOnucmotif, 'CCANNNNATC. 



Since all members of the tomato Lhc gene fam- 
ily express the typical circadian mRNA accumula- 
tion pattern a computer-based search was initiated to 
screen for a conserved c/s-regulatory element in the 
5'-upstream sequences. This analysis was performed 
with long DNA sequences (e.g. 400 nucleotides) but 
failed to identify a possible candidate which may be 
responsible for the mRNA oscillations [25]. Therefore 
the primary goal of the presented experiments was 
to explore the 5'-upstream sequences of the tomato 
Lhc genes by promoter deletion analysis in transgenic 
tobacco plants. 

Promoter deletion analysis 

Tobacco leaf discs were transformed via Agrobac- 
terium tumefaciens-mediated gene transfer with pro- 
moter deletion constructs of four tomato Lhc genes, 



Lhc bl*l (cab 1A; accession numbers Ml 4445, 
M30616; promoter: X60922), Lhc bl*2 (cab IB, 
M 14443; promoter: X60923), Lhc a3 (cab 8; 
XI 5258), Lhc a4 (cab 1 1 5 X57706). About fifty pri- 
mary transformants of each deletion construct were 
regenerated and tested for Lhc mRNA expression lev- 
els. Individual plant lines with high expression levels 
were chosen for further analysis. 

To find out which of the deleted promoters medi- 
ate the circadian oscillations of tomato Lhc mRNA, 
the tobacco transformants were grown in LD and DD 
conditions. Leaves were harvested at appropriate time 
points and steady-state mRNA levels were determined 
by northern blot (Figure 4 for a description of meth- 
ods, see [24]) or primer extension (Figures 1 A, 2, 3A) 
analysis and quantitated by primer extension analysis 
(Figure IB and 3B, for a description of methods, see 



Table 1. Sequence motif presence in 5' -upstream regions of Lhc genes. 
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* Sequence overlaps with the 'CAAT box. 



[18, 25]. The mRNA levels of the tomato Lhc bl * 1 
deletions A- 1091 (Nos. 4, 1 1, 38) and A-701 (Nos. 4, 
9, 26) clearly oscillate in LD with a period of ca. 24 h. 
Only very little or no tomato Lhc mRNA is detectable 
at 06:00, maximum mRNA levels were reached at 
10:00 and levels decrease thereafter (Figure 1A and 
B). The phase of the circadian rhythms are not altered 
in different deletion constructs. In constant darkness 
the amplitudes are significantly reduced but transcripts 
increase after the night trough at the appropriate time 
may indicate that an endogenous oscillator influences 
the expression of the tomato Lhc gene. 

In contrast to the long promoter constructs it is very 
unlikely that the tobacco lines with the short promoter 
regions of ZJ?cbl*l A- 159 (Nos. 2, 42, 47) and A-43 
(Nos. 24, 28, 33, 34) exhibit circadian mRNA ac- 
cumulation patterns (representative data in Figure 1 ). 
The mRNA levels fluctuate only twofold at low ex- 
pression levels, however a constant period length of 
ca. 24 h was not measured in LD and DD, particu- 
larly well documented after quantitation (Figure 1 B, 
right panels). These results clearly show that the short 
5'-upstream regions are sufficient for a basal mRNA 
accumulation and strongly suggests that sequences up- 
stream of —159 are necessary for circadian mRNA 
accumulation of the tomato Lhc bl*l gene. Further- 
more, it is interesting to note that the expression level 
of the short constructs do not decrease in DD, while 
this is usually observed for complete Lhc genes which 
are in their native 3' and 5' nucleotide surrounding. 

With the Lhc a3 deletion constructs similar results 
were obtained as with Lhc bl*l (Figure 2). Deletion 
A-322 (No. 6) and A-278 (No. 14) exhibited circa- 



dian Lhc mRNA accumulation while the mRNA of 
the deletion constructs A-231 (Nos. 24, 36) and A- 
148 (Nos. 16, 25) reached almost constant levels in 
LD and DD and no oscillations with a defined period 
length could be observed. Based on these data it is 
likely that a region of 47 nucleotides (—278 to -231) 
is necessary for circadian Lhc a3 mRNA oscillations. 
The low expression levels of the Lhc a3 gene in trans- 
genic tobacco prevented a quantitation with the primer 
extension analysis. 

Investigation of the Lhc bl *2 (Figure 3) and Lhc 
a4 (Figure 4) genes in tobacco revealed circadian 
Lhc mRNA accumulations for constructs that con- 
tain long as well as short 5'-upstream regions {Lhc 
bl*2:A-793,Nos. 11,22; A-323, Nos. 13, 17; A-152, 
Nos. 1, 4, 21, 24, Lhc a4: A-251, No. 60, A- 119, 
No. 37, 74). The mRNA constructs of the Lhc bl*2 
gene could be quantitated with the primer extension 
analysis and representative results are depicted in Fig- 
ure 3B. A circadian oscillatory pattern is obvious for 
each deletion construct. It should be noted that the 
short 5'-upstream regions of Lhc bl*2 and Lhc a4 are 
apparently sufficient to confer circadian rhythm icity. 
This is a surprise since in contrast such short promoter 
regions of the Lhc bl*l and Lhc a3 genes exhibit an 
almost constant expression pattern (Figures 1 and 2). It 
therefore can be concluded that different 5'-upstream 
regions (relative to the transcription start sites) of the 
tomato Lhc genes are necessary for conferring circa- 
dian rhythmicity. The results of all deletion constructs 
are summarized in an overview in Figure 5. 
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The promoter deletion analysis indicated regions 
which are necessary for the transmission of the cir- 
cadian clock in transgenic plants. If a conserved 
'clock' regulatory element exists it should be local- 
ized within these respective regions. Therefore the 
following S'-upstream sequences were aligned: Lhc 
bl*l, -701 to -159; Lhc bl*2, -152 to -I; Lhc 
a3: -278 to -231; Lhc a4, -119 to -1. A novel 
sequence motif appears to be conserved in all four 
tomato Lhc 5'-upstream regions (Table 1). The se- 
quence is 'CAANNNNATC, it is composed of three 
conserved nucleotides, four variable nucleotides fol- 
lowed by three conserved nucleotides. Identical or 
very similar motifs are present in short 5' promoter re- 
gions of circadian controlled Lhc genes of Arabidopsis 
thaliana (Lhc bl*l {cab 2), -1 1 1 to -74 [2, 7]; Lhc 
bl*2 (cab 3), -371 [23] and Triticum aestivum (Lhc 
bl (cab 1), -211 to -90 [10]). Consistent with the 
hypothesis that this motif plays a role in transmitting 
circadian rhythmicity is its absence in the promoter 
of the non-circadian expressed Lhc b gene of Pinus 
contorta (—1000 to -fl [5]). Also interesting in this 
context are the investigations of Wang et al. [30]. Fol- 
lowing the phytochrome- regulated light signal trans- 
duction pathway of the A. thaliana Lhc bl*3 gene a 
specific protein. CCA1. a myb-related transcription 
factor, was identified. Interestingly, the binding site 
of this transcription factor overlaps with the motif 
which we found to be conserved in the S'-upstream 
regions that are necessary for circadian expression of 
the four tomato Lhc genes. These findings support the 
emerging idea that light perception and clock func- 
tion are closely related, a hypothesis recently stated 
by Kay [17]. His conclusion was based on the pres- 
ence of the PAS domains in clock-related proteins such 
as PER (period, Drosophila) and WC (white collar, 
Neurospora) as well as in photosensory proteins such 
as phytochrome (PHY, plants) and the bacterial blue- 
light receptor (PYP, photoactive yellow protein [20]). 
Isolation and characterization of the /raws-regulatory 
factor(s) binding to the novel 'CAANNNNATC mo- 
tif are on the way and will bring further knowledge 
regarding the hypothesis that the signal transduction 
pathways of the light and the circadian clock cross-talk 
or converge. 
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